Picture for Qi Cao

Qi Cao

DAJ: Data-Reweighted LLM Judge for Test-Time Scaling in Code Generation

Add code
Jan 29, 2026
Viaarxiv icon

FunPRM: Function-as-Step Process Reward Model with Meta Reward Correction for Code Generation

Add code
Jan 29, 2026
Viaarxiv icon

Models Under SCOPE: Scalable and Controllable Routing via Pre-hoc Reasoning

Add code
Jan 29, 2026
Viaarxiv icon

DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding

Add code
Dec 17, 2025
Figure 1 for DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding
Figure 2 for DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding
Figure 3 for DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding
Viaarxiv icon

AsarRec: Adaptive Sequential Augmentation for Robust Self-supervised Sequential Recommendation

Add code
Dec 16, 2025
Figure 1 for AsarRec: Adaptive Sequential Augmentation for Robust Self-supervised Sequential Recommendation
Figure 2 for AsarRec: Adaptive Sequential Augmentation for Robust Self-supervised Sequential Recommendation
Figure 3 for AsarRec: Adaptive Sequential Augmentation for Robust Self-supervised Sequential Recommendation
Figure 4 for AsarRec: Adaptive Sequential Augmentation for Robust Self-supervised Sequential Recommendation
Viaarxiv icon

GoalRank: Group-Relative Optimization for a Large Ranking Model

Add code
Sep 26, 2025
Viaarxiv icon

Fine-tuning Done Right in Model Editing

Add code
Sep 26, 2025
Figure 1 for Fine-tuning Done Right in Model Editing
Figure 2 for Fine-tuning Done Right in Model Editing
Figure 3 for Fine-tuning Done Right in Model Editing
Figure 4 for Fine-tuning Done Right in Model Editing
Viaarxiv icon

DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning

Add code
May 26, 2025
Viaarxiv icon

Too Consistent to Detect: A Study of Self-Consistent Errors in LLMs

Add code
May 23, 2025
Viaarxiv icon

The 1st Workshop on Human-Centered Recommender Systems

Add code
Nov 22, 2024
Figure 1 for The 1st Workshop on Human-Centered Recommender Systems
Viaarxiv icon